On the combination of auditory and modulation frequency channels for ASR applications

نویسندگان

  • Fabio Valente
  • Hynek Hermansky
چکیده

This paper investigates the combination of evidence coming from different frequency channels obtained filtering the speech signal at different auditory and modulation frequencies. In our previous work [1], we showed that combination of classifiers trained on different ranges of modulation frequencies is more effective if performed in sequential (hierarchical) fashion. In this work we verify that combination of classifiers trained on different ranges of auditory frequencies is more effective if performed in parallel fashion. Furthermore we propose an architecture based on neural networks for combining evidence coming from different auditory-modulation frequency sub-bands that takes advantages of previous findings. This reduces the final WER by 6.2% (from 45.8% to 39.6%) w.r.t the single classifier approach in a LVCSR task.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using WPT as a New Method Instead of FFT for ‌Improving the Performance of OFDM Modulation

Orthogonal frequency division multiplexing (OFDM) is used in order to provide immunity against very hostile multipath channels in many modern communication systems.. The OFDM technique divides the total available frequency bandwidth into several narrow bands. In conventional OFDM, FFT algorithm is used to provide orthogonal subcarriers. Intersymbol interference (ISI) and intercarrier interferen...

متن کامل

Evaluation Performance of OFDM Mutlicarrier Modulation over Rayleigh and RicianStandard Channels Using WPT-OFDM Modulations

Last years, Wavelet Packet Modulation (WPM) or Wavelet Packet Transform based Orthogonal Frequency Division Multiplexing (WPT-OFDM) have been introduced to wired and wireless communication fields as efficient Multicarrier Modulation (MCM) techniques. The wavelets have interesting features such as flexibility, compatibility and localization in both time and frequency domains with no need to use ...

متن کامل

O23: Modulation of Pacemaker Channels and Rhythmic Thalamic Activity by Demyelination and Inflammatory Cytokines

The thalamus is a central element for the generation of rhythmic oscillatory activity under physiological and pathophysiological conditions. Especially slow oscillations in the delta and theta frequency band which normally occur during slow-wave sleep are associated with a number of neuropsychiatric conditions if they occur during wakefulness and may be the basis for the generation of character...

متن کامل

An Efficient Hierarchical Modulation based Orthogonal Frequency Division Multiplexing Transmission Scheme for Digital Video Broadcasting

Due to the increase of users the efficient usage of spectrum plays an important role in digital terrestrial television networks. In digital video broadcasting, local and global content are transmitted by single frequency network and multifrequency network respectively. Multifrequency network support transmission of global content and it consumes large spectrum. Similarly local content are well ...

متن کامل

Robust speech recognition using the modulation spectrogram

The performance of present-day automatic speech recognition (ASR) systems is seriously compromised by levels of acoustic interference (such as additive noise and room reverberation) representative of real-world speaking conditions. Studies on the perception of speech by human listeners suggest that recognizer robustness might be improved by focusing on temporal structure in the speech signal th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008